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Abstract —In this paper we study the problem of learning from 
multiple modal data for purpose of document classification. In 
this problem, each document is composed two different modals of 
data, i.e., an image and a text. Cross-modal factor analysis (CFA) 
has been proposed to project the two different modals of data to 
a shared data space, so that the classification of a image or a text 
can be performed directly in this space. A disadvantage of CFA 
is that it has ignored the supervision information. In this paper, 
we Improve CFA by incorporating the supervision information to 
represent and classify both image and text modals of documents. 
We project both image and text data to a shared data space 
by factor analysis, and then train a class label predictor in 
the shared space to use the class label information. The factor 
analysis parameter and the predictor parameter are learned 
jointly by solving one single objective function. With this objective 
function, we minimize the distance between the projections of 
image and text of the same document, and the classification error 
of the projection measured by hinge loss function. The objective 
function is optimized by an alternate optimization strategy in an 
iterative algorithm. Experiments in two different multiple modal 
document data sets show the advantage of the proposed algorithm 
over other CFA methods. 

Index Terms —Multiple modal learning. Cross-modal factor 
analysis. Supervised learning 

I. Introduction 

In this paper, we deal with the problem of learning from 
multiple modal data ID, 12. Traditional data representation, 
classification and retrieval problems usually focus on single 
modal data 0, 0. For example, for the problem of text 
classification, we usually only consider using a data set of text 
to train a classifier 0, 0. While for the problem of image 
representation, only the images are considered to learn the 
representation parameters Q, 0. Zhang et al. developed and 
optimized association rule mining algorithms and implemented 
them on paralleled micro-architectural platforms 13, M- 


However, in modern information landscape, the data is usually 
composed of different modals. For example, in video clips, 
there are two modals of data, e.g. image sequence data and 
audio data. Moreover, in a research article, there are not 
only the text data, but also the image data. Learning from 
these multiple modal data has attracted much attention from 
both machine learning and multimedia information processing 
communities CD, C2. Recently, cross-modal factor analysis 
(CFA) has been proposed to project different modal of data 
to a shared feature space so that classification or retrieval 
can be performed cross data modal CD- It assumes that each 
document is composed of two modals of data, e.g., image and 
text, and try to learn a projection matrix for each modal, so 
that the projections of the two modals of a document can be 
as close to each other as possible. With these projections, we 
can project any data of one of the two modals to the shared 
data space, and then perform retrieval cross these two modals. 
Moreover, since the data of different modals can be projected 
to a shared data space, we can also train a classifier from both 
the modals for the classification of data of any modal. CFA is 
further extended to its kernel version in m. 

In this paper, we consider the problem of learning a 
classifier from multiple modal data, and using the classifier 
to classify a single modal data. In a training set of documents, 
each document is composed of data of two modals, e.g., an 
image and a text. We can first apply CFA to project two modals 
to a shared space and then learn a classifier in this space by 
using the class label information. However, in the phase of 
projection, the class label information has been ignored by 
CFA. Actually, without using supervision information provided 
by class label information, it cannot be guaranteed that the 
projections are discriminative enough. Although we can apply 


powerful classification methods after the projection of multiple 
modal data, the discriminative information which is necessary 
for classification may has been lost during the projection 
procedure. Thus it is very necessary to include the available 
class label information in the CFA projection. Surprisingly, no 
work has been done to incorporate the supervision information 
contained by the class labels to improve the discriminative 
ability of CFA. To fill this gap, in this paper, we proposed the 
first supervised CFA method by regularizing the projections of 
different modals by the class label information. 

The contributions of this paper are of two folds: 

1) For the first time, we propose the formulation of super¬ 
vised CFA. We propose to project the data of two modals 
to a shared data space by orthonormal transformations, 
and use a linear class label function to predict the 
class labels of the training multiple modal documents. 
To formulate the problem, we propose to minimize 
the difference between the projections of two modals 
of the same document, and simultaneously minimize 
the classification error of both modals measured by 
hinge loss. In this way, the learning of orthonormal 
transformation matrices of multiple modal projections 
and the class label predictor parameter are unified, 
and the predictor learning can regularize the learning 
of orthonormal transformation matrices to improve the 
discriminative ability of the multiple modal projections. 

2) We also develop an iterative algorithm to optimize the 
constrained minimization problem of this formulation. 
The projection parameter and the predictor parameter 
are optimized alternately in an iterative algorithm. The 
orthonormal transformation matrices are optimized by 
fixing class label predictor parameter matrix and solving 
a singular value decomposition (SVD) problem lEl, 
m. The class label predictor parameter matrix is solved 
by fixing orthonormal transformation matrices and solv¬ 
ing a quadratic programming (QP) problem. lEl, ESI 

The remaining of this paper is organized as follows: in sec¬ 
tion |II] we introduce the proposed supervised CFA (SupCFA), 
in section Hn] the proposed method is evaluated experimentally, 
and in section |IV] the paper is concluded. 

II. Proposed method 
A. Problem formulation 

We assume that we have a training set of n documents V — 
{Di, • • ■ , Dn}, where Di is the i-th document. Each document 
is comprised of an image component and an accompanying 
text, i.e., Di = {li^Tf), where li G is a d/-dimensional 
feature space of the image, Ti G is a a dr-dimensional 
feature of the text. These documents are assumed to belongs to 
m classes, and for the i-th document, we define a class label 
vector = [t/ii,-" ,2/™] S {-fl,-!}"* to indicate which 
class it belongs to. The j-th dimension of y,, = -1-1 if Di 

belongs to the j-th class, and j/y = — 1 otherwise. The goal 
of cross-modal classification is to learn a predictor from the 
training set, and use the predictor to predict the class label 
vector of given text (image) query T (/). 


To this end, we first project images and texts of documents 
to a shared d-dimensional feature space by orthonormal trans¬ 
formations, and then learn a linear prediction function in the 
share space to predict the class label vector. The projection in 
the image and text space are given as follows: 


h hrii G T, ^ G (1) 

where 17/ G orthonormal transformation matrix 

of the image data, and S j^^Txd orthonormal 

transformation matrix of the text data. CFA assumes that the 
projections of the image and text of a single document should 
be as close to each other as possible, and the squared £2 norm 
distance between liflj and Tiflr is minimized over all the 
training documents. 


mm ^ ||17//,-17TTi||^ 

ilf ,ilT . 


S.t. 17/ 17/ — £dxd^ — £dx.di 


( 2 ) 


where Idxd ts a dj x dj identity matrix. To predict the class 
label vector y^ from the projections of image and text of the 
i-th document, we try to learn a linear function as follows. 


Yi ^ = (/il7/)VF, 

y, ^ fwimr) = {VLtT,)W, ^ ’ 

where VF G is the predictor parameter matrix. To learn 

W, we minimize its squared I 2 norm and the hinge loss of 
the predictor over both the images and texts of all the training 
documents. 


min WWl + CiY^iS^i+ei) 
w ^' 

(4) 

s.t. h - {I,Vli)WyJ < ^i, > 0, 

h - {nTTi)WyJ <ei,ei>0,i = l,---,n. 

where ^i is the slack variable of the hinge loss over the image 
of the /-th document, £i is that of the text of the i-th document, 
is a parameter of the hinge loss function, and Ci is a 
tradeoff parameter. The minimization of || VFlU is to reduce the 
complexity of the predictor and also to seek a large margin. 
The hinge loss is applied to reduce the prediction error. 

The formulation of the proposed method is the combination 
of (|2]i and (|4|, which is as follows. 


min 

W.Qi.Qt 

S.t. 


^I|W"II2 + Cl ^ 

h-imi)WyJ > 0 , 

h - {TinT)WyJ < £*, e* > 0, i = 1, • • • ,n, 


2 

2 


17/ 17/ — Idxd, 17/117/’ — Idxdy 

(5) 

where C 2 is another tradeoff parameter. Please note that in this 
formulation, not only the orthonormal transformation matrices 
are variables, but also the predictor parameter. In this way, the 


representation and the classification of image and text modals 
are unified. Both the images and texts are mapped to a shared 
space and then a shared predictor are applied to classify them. 


B. Optimization 


To solve the problem in (|5]), we write the Lagrange function 
as follows. 


C =i||VL||2 + Cl + c) + ^2 ^ ||/,L!/ - mrWl 

i=l 

n n 

+ ^ ai (h - - Ci) “ X! 

i—1 i—1 

n n 

+ X! “ {Ti^T)WyJ - Ei) - X! 

i=l i=l 

- Tr (PT -Ia^a))-Tr (A^ {nlflT - Idxd)) , 

( 6 ) 

where ai, Pi, 7 i, 5i,i = I,-- - ,n, L and A are Lagrange 
multiplier variables. To seek the minimization of the objective, 
we set the partial derivative of C with regard to W, and 
to zero respectively. 


„ . n n 

— = w-Y, - E = 0, 

i—1 i—1 

n n 


i=l 


i=l 


d^i 

de. 


= Cl — tti — pi = 0 ^ Cl — at = Pi > 0 ^ ai < Cl 


= Cl — 7i — i5i = 0 => Cl — 7i = > 0 =7 7i < Cl. 


By substituting (|7]i to (|6]l, we have 


(7) 


n 

= "2 E (njilyjji.ni) 

ij=l 

n 

+ E ai-ijTr{VliIiy]y^TjVtT) 

i,3 = t 

1 " 

- 2 E y.yJr.OT) 

* j=i 

n n 

+ C 2 E (^7 IjI^^i) - 2 C 2 E (^7 IjTiflr) + 

i—1 i—1 

n 

C 2 E {^TTjTinr) , and 

i^l 

9{ai\^=i,ji\^i,nj,nT,A,T) 

n 

= /iE(a* +7i) 

i^l 

- Tr (r^ {n]ni - /dxd)) - Tr (A^ (nfrir - h^d)) ■ 

(9) 

The optimization problem is transferred to the following 
coupled problem. 


max min C, 

“d"=ii7»l”=iirA 

s.t. 0 < CTi < Cl, 0 < 7 i < Cl, f = 1, • • • , n, 

r > 0, A > 0. 


( 10 ) 

To solve this problem, we employ the alternate optimization 
strategy. We optimize 7 i|",^i and T, A,r2/,nT alter¬ 

nately in an iterative algorithm. In each iteration, we first 
fix r. A, 17/, fly and optimize Q!i|7^i, 7 i|”=i, and then we fix 
ailr=i,7i|?=i and optimize L, A,n/,f7T- 

1) Optimizing Q!i|"^i and 7i|"=i.' Fixing F, A, Qj and Qt, 
and removing the terms irrelevant to ailiLi and 7 i|”=i from 
([8]l, we rewrite ( fTOl i as 


1 , 

“ 9 E {njIj y,yj7^17/) 

1,1=1 

n 

+ E Oii-ijTr{VLiIiyJyjTjVLT) 


i,J=l 

1 ” (11) 
- 2 E 7C.Te yiyiT.nr) 


ij=l 

n 

+ h'^{ai + 7i) 

i^l 

si. 0 < < (71,0 < 7 i < Ci,i = 1, • • * ,n. 


T =/(a,|r=i, 7zir=i, TLt) + fl(a,|"=i,7^ir=i, A,r) 


( 8 ) 


This problem can be solved as a QP problem. 

2) Optimizing f7/ and VLt: By hxing Q;i|7^i and 7 i|"^i, 
and removing terms irrelevant to F, A, 17/ and 17 t from ®, 
we rewrite (doll as 


where 


n 


(15) 


max min 
r.A Qi,Q,t 


1 

2 


o-iajTr (nJlJyjJljni) 

*j'=i 


+ ^ aajTrinJlJyjjT.nr) 


1 

- 2 E {nWy^yjT.^T) 

i,j=l 

n 

+ C2^rr(f27j7/,f27) 

2^1 

n 

-2C2^Tr 

z^l 

n 

+ ^Tr (rifT^T^riT) 

Z^l 

-Tr (r^ {njni-h^d)) 

-Tr (A^ (r!jOT-/dxd)), 

s.i. r > 0, A > 0. 

The primal problem of this dual problem is 


( 12 ) 


where 


max Tr (jljZ^I t) 

Q/ ,Qx’ 

S.t. nj Qj = Idxd, = Idxd- 


n n 

z=J2 oi^l3ily^y]T, - 2C2 (16) 

i,j—l 2=1 

The optimal matrices 12/ and Q,t can be obtained by a SVD 
of the matrix Z, i.e., 


Z = 12/S12J (17) 

where S is the matrix of singular values of Z, and S is 
diagonal. 

C. Algorithm 

Based on the optimization results, we can design an iterative 
algorithm as Algorithm [T] The iterations are repeated T times, 
and the f-th each iteration, the variables 12j, 12^, a\\2^i and 
are updated alternately. Finally the predictor parameter 
W is calculated from the updated variables. 


1 . ' 

*j=i 

n 

+ E (^i/^7yiyjT’ji2r) 

ij'=l 
1 " 

- 2 E yj^T.^T) 

* J=1 

n n 

+ C 2 ^ Tr (127 17I^^I) - 2^2 ^ Tr {njij T,12t) 

Z=1 2=1 

n 

TY.Tr (127t;^T,12t) 

2=1 

s.t. Vtj VLj = Idxd, TI'yTIt — Idxd- 

(13) 

Using the constrains 12712/ = Idxd and 12712 t — Idxd, we 
can remove some constant terms from ( fT3l l and rewrite it as 


Algorithm 1 Iterative learning algorithm of supervised cross- 
modal factor analysis. 

Input: A training set of n documents 

{(/i,Ti), •••,(/„,T„)}, and corresponding class label 
vector set {yi, • • ■ ,y„}; 

Input: Tradeoff parameters Ci, C 2 , and maximum iteration 
number T; 

Initialize orthonormal transformation matrices llj and 127; 

for 2 = 1, • • • , T do 

Fix 12j“^ and 12^^, and update and by 

solving the problem in (fTTT i: 

Fix and , and update Z* as in (fT^ : 

Update 12j and 127 I’Y ^PPlying SVD to Z* as in (fTTI) : 
end for 

Calculate predictor function parameter matrix W from flf, 
a7l?=i and ©. 

Output: The orthonormal transformation matrices flj and 
127 and predictor function parameter matrix W. 


min 
Q/ ,C2t 


Y a,7jTr (nJl^yY^j^r) 

ij'=l 


n 

-2C2^Tr (127 j7t,12t) 
2=1 



s.t. QJQi — Idxd,I^I^T — Idxd- 


III. Experiments 

In this section, we will investigate the proposed algorithm 
experimentally. 

A. Experiment setup 

1) Data sets: In this experiment, we used two different 
\ \document data sets composed of images and texts. The first 
2 C 2 ^/ 7 rJl 2 T data set is the TVGraz database ED, which is a multimodal 
»=i / / database of object categories composed of textual and visual 

features. The documents belongs to 10 of 256 classes of 
(14) Caltech-256. 1,000 webpages are retrieved for each of the 
10 classes, and 2,058 image-text pairs are collected. Each 


The equivalent problem is 





image-text pair is linked to a document, thus we have 2,058 
documents of 10 classes in this data set. 

The seconde data set is the Wikipedia database, which is se¬ 
lected from Wikipedia featured article database, and Wikipedia 
featured article database contains documents of 30 classes. 
Because most of the classes of Wikipedia featured articles 
database contains very few documents, we only choose the 
10 classes with the most documents. Moreover, each featured 
article usually have more than one image and section, so we 
split each featured article to several documents. Each document 
contains a section of a featured article, and the images placed 
to this section. We have in total 7,114 documents. Moreover, 
we remove the documents which have more than one image, 
and the documents which have a text with less than 70 words. 
Finally, we have 2,866 documents in total in our experiment. 

The images in the documents are represented as feature 
vectors using the bag-of-features method, while the texts in the 
documents are represented as feature vectors using the bag-of- 
words method. 

2) Experiment protocol: To conduct the experiment, we 
used the 10-fold cross validation strategy. The entire data set 
is split to 10 folds randomly, and each fold was used as a 
test set in turn, while the remaining 9 sets were combined 
and used as a training set. The proposed learning algorithm 
was performed to the training set to learn the orthonormal 
transformation matrices and the predictor parameter matrix, 
and then they are used to represent and classify the individual 
images and texts in the test set. The classification accuracy is 
measured by the classification rate as follows. 


classification rate 

number of correctly classified images and texts 
total number of test images and texts 

(18) 


B. Results 


TVGraz Wikipedia 



SupCFA CFA_ker CFA SupCFA CFA_ker CFA 


(a) TVGraz 


(b) Wikipedia 


Fig. 1. Boxplots of classification rates of lO-fold cross validation of different 
CFA methods. 



Fig. 2. Convergency curve. 


1) Comparison to unsupervised CFA: We first compare the 
proposed supervised version of CFA against unsupervised CFA 
methods on the problem of image/text classification problem. 
We considered the original CFA method ifTSl and its kernel 
version (CFAfeer) M as data representation methods, and 
used a SVM as a classifier. The boxplots of classification 
rates of the 10-fold cross validations of the compared methods 
over two data sets are given in Fig. [T] It is clear that the 
proposed SupCFA outperforms the two unsupervised CFA 
methods completely. The low quartile of the SupCFA clas¬ 
sification rates is even higher than the upper quartiles of the 
tow compared methods. This is not surprising at all because 
SupCFA is the only method which can explore the supervision 
information to improve the discriminative ability of cross- 
modal factor analysis, while CFA and CFA^er ignore the 
class label information at all. Moreover, it seems that CFAfce^ 
outperforms CFA due to its usage of kernel tricks. 

2 ) Algorithm convergency: Since the proposed algorithm ia 
an iterative algorithm, it is important to study its convergency 
over iterations. We plot the curve of objective function over 
different iterations in Fig.|2l It could be seen that the objective 
function is reduced significantly in the first 30 iterations, and 
it tends to converge after the 30-th iteration. 

IV. Conclusions and Future works 

In this paper, we proposed the first supervised CFA method 
for the presentation and classification of multiple modal data. 
The proposed method not only projects data of different 
modals to a shared data space like CFA, but also tries to 
learn a predictor to predict the class labels from this data 
space. Moreover, the learning of projection and class label 
prediction parameters are learned within a single objective 
function. By optimizing this objective function with regard to 
both projection and class label prediction parameters jointly, 
the class label information is used to guild the learning of CFA 
parameters. The experiment results show that the supervised 
CFA outperforms both linear and kernel versions of CFA 
without considering class label information. We also plan to 
explore potential of proposed algorithm using big data analysis 
and high performance computing Il20l . IJT), ll22ll . Il2^ . Il24ll . 

H, Cini, Ga, M, Eli, ED, ED, El, ED, EQl, for 

applications of big multimodal data processing E21, EH, 
13, ESI, ESI, Ell, information and network security ESI, 










































ESI, iOl, ED, Sa, ED, El, El, El, E3, computer 
vision El, El, El, El, ED, El, El, ES, El, and 
bioinformatics El, El, El, EQ), ED, El, El, El, El, 
El, El, El, El, El, GqI, El, El- 

Acknowledgements 

The research reported in this publication was supported by 
competitive research funding from King Abdullah University 
of Science and Technology (KAUST), Saudi Arabia. 

Reeerences 

[1] F. Carenzi, P. Bendahan, V. Roschin, A. Frolov, P. Gorce, and M. Maier, 
“A generic neural network for multi-modal sensorimotor learning,” 
Neurocomputing, vol. 58-60, pp. 525-533, 2004. 

[2] A. Lumini and L. Nanni, ‘An advanced multi-modal method for human 
authentication featuring biometrics data and tokenised random numbers,” 
Neurocomputing, vol. 69, no. 13-15, pp. 1706-1710, 2006. 

[3] K.-S. Lee, A. Nurzid Rosli, I. Ariesthea Supandi, and G.-S. Jo, “Dynamic 
sampling-based interpolation algorithm for representation of clickable 
moving object in collaborative video annotation,” Neurocomputing, vol. 
146, pp. 291-300, 2014. 

[4] P. Szymczyk and M. Szymczyk, “Classification of geological structure 
using ground penetrating radar and laplace transform artificial neural 
networks,” Neurocomputing, vol. 148, pp. 354-362, 2015. 

[5] D. Merkl, “Text classification with self-organizing maps: Some lessons 
leai'ned,” Neurocomputing, vol. 21, no. 1-3, pp. 61-77, 1998. 

[6] H.-J. Kim, J.-U. Kim, and Y.-G. Ra, “Boosting naive bayes text clas¬ 
sification using uncertainty-based selective sampling,” Neurocomputing, 
vol. 67, no. 1-4 SUPPL., pp. 403-410, 2005. 

[7] J. Caicedo, J. BenAbdallah, F. Gonzalez, and O. Nasraoui, “Multimodal 
representation, indexing, automated annotation and retrieval of im¬ 
age collections via non-negative matrix factorization,” Neurocomputing, 
vol. 76, no. 1, pp. 50-60, 2012. 

[8] C. Hong and J. Zhu, “Hypergraph-based multi-example ranking with 
sparse representation for transductive learning image retrieval,” Neuro¬ 
computing, vol. 101, pp. 94-103, 2013. 

[9] F. Zhang, Y. Zhang, and J. D. Bakos, “Accelerating frequent itemset 
mining on graphics processing units,” The Journal of Supercomputing, 
vol. 66, no. 1, pp. 9^117, 2013. 

[10] F. Zhang, Y. Zhang, and J. Bakos, “Gpapriori: Gpu-accelerated frequent 
itemset mining,” in Cluster Computing (CLUSTER), 2011 IEEE Inter¬ 
national Conference on, 2011, pp. 590-594. 

[11] J. Costa Pereira, E. Coviello, G. Doyle, N. Rasiwasia, G. Lanckriet, 
R. Levy, and N. Vasconcelos, “On the role of con'elation and abstraction 
in cross-modal multimedia retrieval,” Pattern Analysis and Machine 
Intelligence, IEEE Transactions on, vol. 36, no. 3, pp. 521-535, March 
2014. 

[12] Y. Chen, L. Wang, W. Wang, and Z. Zhang, “Continuum regression 
for cross-modal multimedia retrieval,” in 2012 19th IEEE International 
Conference on Image Processing (ICIP 2012), 2012, pp. 1949 - 52. 

[13] D. Li, N. Dimitrova, M. Li, and I. K. Sethi, “Multimedia content 
processing through cross-modal association,” in Proceedings of the 
eleventh ACM international conference on Multimedia. ACM, 2003, 
pp. 604-611. 

[14] Y. Wang, L. Guan, and A. N. Venetsanopoulos, “Kernel cross-modal 
factor analysis for multimodal information fusion,” in Acoustics, Speech 
and Signal Processing (ICASSP), 2011 IEEE International Conference 
on. IEEE, 2011, pp. 2384-2387. 

[15] F. Liu, G. Yang, Y. Yin, and S. Wang, “Singular value decomposition 
based minutiae matching method for finger vein recognition,” Neuro¬ 
computing, vol. 145, pp. 75-89, 2014. 

[16] X. Zhang, Z. Xu, N. Jia, W. Yang, Q. Feng, W. Chen, and Y. Feng, 
“Denoising of 3d magnetic resonance images by using higher-order 
singular value decomposition,” Medical Image Analysis, vol. 19, no. 1, 
pp. 75-86, 2015. 

[17] P. Miao, Y. Shen, and X. Xia, “Finite time dual neural networks with a 
tunable activation function for solving quadratic programming problems 
and its application,” Neurocomputing, vol. 143, pp. 80-89, 2014. 

[18] F. Fomeni and A. Letchford, “A dynamic programming heuristic for the 
quadratic knapsack problem,” INFORMS Journal on Computing, vol. 26, 
no. 1, pp. 173-182, 2014. 


[19] I. Khan, A. Saffari, and H. Bischof, “Tvgraz: Multi-modal learning of 
object categories by combining textual and visual features,” in AAPR 
Workshop, 2009, pp. 213-224. 

[20] K. Wang, X. Zhou, K. Qiao, M. Lang, B. McClelland, and I. Raicu, 
“Towards scalable distributed workload manager with monitoring-based 
weakly consistent resource stealing,” in Proceedings of the 24rd in¬ 
ternational symposium on High-performance parallel and distributed 
computing. ACM, 2015, pp. 219-222. 

[21] F. Zhang, Y. Gao, and J. D. Bakos, “Lucas-kanade optical flow estimation 
on the ti c66x digital signal processor,” in High Performance Extreme 
Computing Conference (HPEC), 2014 IEEE, 2014, pp. 1-6. 

[22] Y. Gao, F. Zhang, and J. D. Bakos, “Sparse matrix-vector multiply on 
the keystone ii digital signal processor,” in High Performance Extreme 
Computing Conference (HPEC), 2014 IEEE, 2014, pp. 1-6. 

[23] Y. Zhang, F. Zhang, Z. Jin, and J. D. Bakos, “An fpga-based accelerator 
for frequent itemset mining,” ACM Transactions on Reconfigurable 
Technology and Systems (TRETS), vol. 6, no. 1, p. 2, 2013. 

[24] Y. Zhang, F. Zhang, and J. Bakos, “Frequent itemset mining on large- 
scale shared memory machines,” in Cluster Computing (CLUSTER), 
2011 IEEE International Conference on, 2011, pp. 585-589. 

[25] T. Li, X. Zhou, K. Brandstatter, D. Zhao, K. Wang, A. Rajendran, 
Z. Zhang, and I. Raicu, “Zht: A light-weight reliable persistent dynamic 
scalable zero-hop distributed hash table,” in Parallel & Distributed 
Processing (IPDPS), 2013 IEEE 27th International Symposium on, 2013, 
pp. 775-787. 

[26] D. Zhao, Z. Zhang, X. Zhou, T. Li, K. Wang, D. Kimpe, R Cams, 
R. Ross, and I. Raicu, “Fusionfs: Toward supporting data-intensive 
scientific applications on extreme-scale high-performance computing 
systems,” in Big Data (Big Data), 2014 IEEE International Conference 
on. IEEE, 2014, pp. 61-70. 

[27] T. Li, X. Zhou, K. Brandstatter, and I. Raicu, “Distributed key-value 
store on hpc and cloud systems,” in 2nd Greater Chicago Area System 
Research Workshop (GCASR), 2013. 

[28] K. Wang, A. Kulkami, X. Zhou, M. Lang, and I. Raicu, “Using 
simulation to explore distributed key-value stores for exascale system 
services,” in 2nd Greater Chicago Area System Research Workshop 
(GCASR), 2013. 

[29] K. Wang, X. Zhou, T. Li, D. Zhao, M. Lang, and I. Raicu, “Optimizing 
load balancing and data-locality with data-aware scheduling,” in Big 
Data (Big Data), 2014 IEEE International Conference on, 2014, pp. 
119-128. 

[30] K. Wang, X. Zhou, H. Chen, M. Lang, and I. Raicu, “Next generation 
job management systems for extreme-scale ensemble computing,” in 
Proceedings of the 23rd international symposium on High-performance 
parallel and distributed computing, 2014, pp. 111-114. 

[31] K. Wang, N. Liu, I. Sadooghi, X. Yang, X. Zhou, M. Lang, X.-H. Sun, 
and I. Raicu, “Overcoming hadoop scaling limitations through distributed 
task execution,” in Proc. of the IEEE International Conference on Cluster 
Computing 2015 (ClusterlS), 2015. 

[32] H. Wang and J. Wang, ‘An effective image representation method using 
kernel classification,” in Tools with Artificial Intelligence (ICTAI), 2014 
IEEE 26th International Conference on, Nov 2014, pp. 853-858. 

[33] Y. Wang, W. Jiang, and G. Agrawal, “Scimate: A novel mapreduce- 
like framework for multiple scientific data formats,” in Cluster, Cloud 
and Grid Computing (CCGrid), 2012 12th lEEE/ACM International 
Symposium on, 2012, pp. 443-450. 

[34] Y. Wang, Y. Su, and G. Agrawal, “Supporting a light-weight data 
management layer over hdf5,” in Cluster, Cloud and Grid Computing 
(CCGrid), 2013 13th lEEE/ACM International Symposium on. IEEE, 
2013, pp. 335-342. 

[35] Y. Wang, A. Nandi, and G. Agrawal, “Saga: array storage as a db with 
support for structural aggregations,” in Proceedings of the 26th inter¬ 
national conference on scientific and statistical database management. 
ACM, 2014, p. 9. 

[36] J. J.-Y. Wang, Y. Wang, S. Zhao, and X. Gao, “Maximum mutual infor¬ 
mation regulaiized classification,” Engineering Applications of Artificial 
Intelligence, vol. 37, pp. 1-8, 2015. 

[37] Y. Wang, G. Agrawal, G. Ozer, and K. Huang, “Removing sequential 
bottlenecks in analysis of next-generation sequencing data,” in Parallel 
& Distributed Processing Symposium Workshops (IPDPSW), 2014 IEEE 
International. IEEE, 2014, pp. 508-517. 

[38] S. Xu, W. Lu, L. Xu, and Z. Zhan, “Adaptive epidemic dynamics in 
networks: Thresholds and control,” ACM Trans. Auton. Adapt. Syst., 
vol. 8, no. 4, pp. 19:1-19:19, Jan. 2014. 


[39] S. Xu, W. Lu, and L. Xu, “Push- and pull-based epidemic spreading in 
networks: Thresholds and deeper insights,” ACM Trans. Auton. Adapt. 
Syst, vol. 7, no. 3, pp. 32:1-32:26, Oct. 2012. 

[40] L. Xu, Z. Zhan, S. Xu, and K. Ye, “An evasion and counter-evasion 
study in malicious websites detection,” in Communications and Network 
Security (CNS), 2014 IEEE Conference on, 2014, pp. 265-273. 

[41] Z. Zhan, M. Xu, and S. Xu, “Characterizing honeypot-captured cyber 
attacks: Statistical framework and case study,” Information Eorensics and 
Security, IEEE Transactions on, vol. 8, no. 11, pp. 1775-1789, 2013. 

[42] L. Xu, Z. Zhan, S. Xu, and K. Ye, “Cross-layer detection of malicious 
websites,” in Proceedings of the third ACM conference on Data and 
application security and privacy. ACM, 2013, pp. 141-152. 

[43] Q. Sun, P. Wu, Y. Wu, M. Guo, and J. Lu, “Unsupervised multi-level 
non-negative matrix factorization model: Binary data case,” Journal of 
Information Security, vol. 3, no. 04, p. 245, 2012. 

[44] Q. Sun, F. Hu, Y Wu, and X. Huang, “Primate-inspired adaptive routing 
in intermittently connected mobile communication systems,” Wireless 
Networks, vol. 20, no. 7, pp. 1939-1954, 2014. 

[45] Q. Sun, W. Yu, N. Kochurov, Q. Hao, and F. Hu, “A multi-agent-based 
intelligent sensor and actuator network design for smart house and home 
automation,” Journal of Sensor and Actuator Networks, vol. 2, no. 3, pp. 
557-588, 2013. 

[46] Q. Sun, F. Hu, and Q. Hao, “Human movement modeling and activity 
perception based on fiber-optic sensing system,” Human-Machine Sys¬ 
tems, IEEE Transactions on, vol. 44, no. 6, pp. 743-754, 2014. 

[47] -, “Mobile target scenario recognition via low-cost pyroelectric 

sensing system: Toward a context-enhanced accurate identification,” 
Systems, Man, and Cybernetics: Systems, IEEE Transactions on, vol. 44, 
no. 3, pp. 375-384, 2014. 

[48] J. J.-Y Wang, H. Bensmail, and X. Gao, “Joint learning and weighting of 
visual vocabulary for bag-of-feature based tissue classification,” Pattern 
Recognition, vol. 46, no. 12, pp. 3249-3255, 2013. 

[49] -, “Multiple graph regularized nonnegative matrix factorization,” 

Pattern Recognition, vol. 46, no. 10, pp. 2840-2847, 2013. 

[50] J. J.-Y Wang and X. Gao, “Beyond cross-domain learning: Multiple- 
domain nonnegative matrix factorization,” Engineering Applications of 
Artificial Intelligence, vol. 28, pp. 181-189, 2014. 

[51] Q. Sun, J. Lu, Y. Wu, H. Qiao, X. Huang, and F. Hu, “Non-informative 
hierarchical bayesian inference for non-negative matrix factorization,” 
Signal Processing, vol. 108, pp. 309-321, 2015. 

[52] J. J.-Y. Wang, H. Bensmai[, and X. Gao, “Feature selection and multi- 
kernel learning for sparse representation on a manifold,” Neural Net¬ 
works, vol. 51, pp. 9-16, 2014. 

[53] J.-Y. Wang, 1. Almasri, and X. Gao, “Adaptive graph regularized nonneg¬ 
ative matrix factorization via feature selection,” in Pattern Recognition 
(ICPR), 2012 21St International Conference on. IEEE, 2012, pp. 963- 
966. 

[54] J. J.-Y. Wang, H. Bensmai[, N. Yao, and X. Gao, “Discriminative sparse 
coding on multi-manifolds,” Knowledge-Based Systems, vol. 54, pp. 199- 
206, 2013. 

[55] J. Luo and A. Brodsky, “Piecewise surface regression modeling in intel¬ 
ligent decision guidance system,” in Intelligent Decision Technologies. 
Springer, 2011, pp. 223-235. 

[56] -, “Piecewise regression learning in corejava framework,” Interna¬ 

tional Journal of Machine Learning and Computing, vol. 1, no. 2, pp. 
163-169, 2011. 

[57] Y. Wang, T. Yang, Y. Ma, G. V. Halade, J. Zhang, M. L. Lindsey, and 
Y-F. Jin, “Mathematical modeling and stability analysis of macrophage 
activation in left ventricular remodeling post-myocardial infarction,” 
BMC genomics, vol. 13, no. Suppl 6, p. S21, 2012. 

[58] Y. Wang, H.-C. Han, J. Y. Yang, M. L. Lindsey, and Y. Jin, “A 
conceptuai ceUular interaction model of left ventricular remodelling post- 
mi: dynamic network with exit-entry competition strategy,” BMC systems 
biology, vol. 4, no. Suppl 1, p. S5, 2010. 

[59] T. Yang, Y. A. Chiao, Y. Wang, A. Voorhees, H.-C. Han, M. L. Lindsey, 
and Y.-F. Jin, “Mathematicai modeling of left ventricular dimensional 
changes in mice during aging,” BMC systems biology, vol. 6, no. Suppl 
3, p. SIO, 2012. 

[60] J. Hu and F. Zhang, “Improving protein localization prediction using 
amino acid group based physichemical encoding,” in Bioinformatics and 
Computational Biology, 2009, pp. 248-258. 

[61] F. Zhang and J. Hu, “Bioinformatics analysis of physicochemical prop¬ 
erties of protein sorting signals,” 2010. 


[62] -, “Bayesian ciassifier for anchored protein sorting discovery,” in 

Bioinformatics and Biomedicine, 2009. BIBM’09. IEEE International 
Conference on, 2009, pp. 424—428. 

[63] M. Bhuyan and X. Gao, “A protein-dependent side-chain rotamer li¬ 
brary.” BMC bioinformatics, vol. 12 Suppl 14, p. SIO, 2011. 

[64] J. J.-Y. Wang, H. Bensmai[, and X. Gao, “Multip[e graph regu[arized 
protein domain ranking,” BMC bioinformatics, voi. 13, no. 1, p. 307, 
2012 . 

[65] Y. Liu, J. Yang, Y. Zhou, and J. Hu, “Structure design of vascuiar stents,” 
Multiscale simulations and mechanics of biological materials, pp. 301- 
317, 2013. 

[66] Y. Zhou, W. Hu, B. Peng, and Y. Liu, “Biomarker binding on an 
antibody-functionalized biosensor surface: the influence of surface prop¬ 
erties, electric field, and coating density,” The Journal of Physical 
Chemistry C, vol. 118, no. 26, pp. 14586-14594, 2014. 

[67] J. J.-Y. Wang, X. Wang, and X. Gao, “Non-negative matrix factorization 
by maximizing correntropy for cancer clustering,” BMC bioinformatics, 
vol. 14, no. 1, p. 107, 2013. 

[68] J. Wang, Y. Li, Q. Wang, X. You, J. Man, C. Wang, and X. Gao, 
“Proclusensem: predicting membrane protein types by fusing different 
modes of pseudo amino acid composition,” Computers in biology and 
medicine, vol. 42, no. 5, pp. 564-574, 2012. 

[69] J. Wang, X. Gao, Q. Wang, and Y. Li, “Prodis-contshc: [earning protein 
dissimilarity measures and hierarchical context coherently for protein- 
protein comparison in protein database retrieval,” BMC bioinformatics, 
vol. 13, no. Suppl 7, p. S2, 2012. 

[70] L. Dai, X. Gao, Y. Guo, J. Xiao, Z. Zhang et ai, “Bioinformatics clouds 
for big data manipulation,” Biology direct, vol. 7, no. 1, p. 43, 2012. 

[71] B. Peng, Y. Liu, Y. Zhou, L. Yang, G. Zhang, and Y. Liu, “Modeling 
nanoparticle targeting to a vascular surface in shear flow through 
diffusive particle dynamics,” Nanoscale Research Letters, vol. 10, no. 1, 
p. 235, 2015. 

[72] S. Wang, Y. Zhou, J. Tan, J. Xu, J. Yang, and Y. Liu, “Computational 
modeling of magnetic nanoparticle targeting to stent surface under high 
gradient field,” Computational mechanics, vol. 53, no. 3, pp. 403^12, 
2014. 



